Language-identification based on cross-language acoustic models and optimised information combination
نویسندگان
چکیده
decoding, the second transforms the parameters from This work is concerned with the subject of languagethe decoding module and classifies the language. identification (LID). Two central issues are addressed. The common acoustic signal preprocessor calculates The first is to analyse the trade-off between detailed 12 RASTA filtered MFCC’s, their first derivatives and acoustic modelling and robust estimation of acoustic the delta-log-energy. The phone and language decoding and language models. The second to find the optimal module consists of three parallel branches. In each of combination of acoustic and language scores for languagethese the phone recogniser matches the acoustic identification. parameters to the acoustic models used by that recogniser. Experiments are carried out using the three languages The output from each recogniser is further matched American-English, German and Spanish from the OGI-TS against three language models. database. It is shown that on the average the acoustic The combined output X from all language models modelling is able to recognise 46.3% of the phones correctly and from all recognisers are used as input to the across the three languages. Insertion and deletion rate ‘information combination and the language-classification’ is 35.7% and 6.6%, respectively. Language-identification module (ICLC). This module enforces a transformation performance is 82.6% with the full set of acoustic models. onto the parameters X and estimates the most probable The performance is increased to 83.7% after having language given the acoustic input. conducted 80 iterations of a hierarchical clustering in which phones are merged across the languages.
منابع مشابه
Language identification incorporating lexical information
In this paper we explore the use of lexical information for language identification (LID). Our reference LID system uses language-dependent acoustic phone models and phone-based bigram language models. For each language, lexical information is introduced by augmenting the phone vocabulary with the N most frequent words in the training data. Combined phone and word bigram models are used to prov...
متن کاملAdvertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles
When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...
متن کاملAutomatic speech recognition of Cantones
This paper describes our recent work on the development of a largevocabulary, speaker-independent, continuous speech recognition system for Cantonese-English code-mixing utterances. The details of both acoustic modeling and language modeling will be discussed. For acoustic modeling, Cantonese accents in English words are handled by applying cross-lingual acoustic units, as well as modifications...
متن کاملA Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information
Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating them potentially can play an important role in transmitt...
متن کاملThe Impact of Structured Input-based Tasks on L2 Learners’ Grammar Learning
Abstract Task-based language teaching has received increased attention in second language research. However, the combination of structured input-based approach and task-based language teaching has not been examined in relation to L2 grammar learning. To address this gap, the present study investigated how the structured input-based tasks with and without explicit information impacted learners’ ...
متن کامل